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Inhibitors of polypeptide GalNAc-transferase lectin domains 
and the functions mediated by these. 



5 - h Technical field 

The present invention relates generally "to the biosynthesis, sorting and 
secretion of mucins, O-glycosylated glycoproteins, and glycoproteins. This invention is 
more particularly related to inhibitors of a homologous family of UDP-N-acetyl-a-D- 
galactosamine: polypeptide W-acetylgaiactosannnyltr^ 
1 0 which add^acetylgalactosamine (GalNAc) to the hydroxy group of serine and 

m threonine amino acid residues in peptides and proteins. This invention is more 

O 

fn particularly related to inhibitors of a lectin domain found in the C-terminal region of 
O - 

W most GalNAc-transferases, which is separate from the catalytic domain of the enzymes. 

This invention is more particularly related to the lectin domains of the GalNAc- 

j 15 transferases, GalNAc-T2, -T3, -T4, and -T7, selective inhibitors of the binding properties 

m 

of these lectin domains, and selective inhibitors of the effects these lectin domains exert 

k- 

p on the functions of GalNAc-transferases 

o 
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2. Background of the invention 

Mucin-type O-glycosylation, one of the most abundant forms of 
protein glycosylate, is found on secreted and cell surface associated glycoproteins of 
all eukaryotic cells except yeast. Mucin-type O-glycans contribute to a number of 
5 important molecular functions, including: direct effects on protein conformation, 
solubility, and stability; specific receptor functions that regulate cell trafficking and 
cell-cell interactions; and microbial clearance. Mucin-type O-glycans are synthesised 
in the Golgi through the sequential addition of saccharide residues, first to hydroxyl - 
groups on serines and threonines of a protein core and subsequently to hydroxyl 
10 groups on the growing saccharide chains that extend from the protein core. There is 
g great diversity in the structures created by O-glycosylation (hundreds of potential 

y 

w structures), which are produced by the catalytic activity of hundreds of 

m glycosyltransferase enzymes that are resident in the Golgi complex. Diversity exists at 

m the level of the glycan structure and in positions of attachment of O-glycans to protein 

Nj 15 backbones. Despite the high degree of potential diversity, it is clear that O- 

glycosylation is a highly regulated process that shows a high degree of conservation 
among multicellular organisms.' 

The factors that regulate the attachment of O-glycans to particular 
protein sites and their extension into specific structures are poorly understood. 
20 Longstanding hypotheses in this area propose that mucin-type O-glycosylation occurs 
in a stochastic manner where structure of acceptor proteins combined with topology 
and kinetic properties of resident Golgi glycosyltransferases determine the order and 
degree of glycosylation (1). This concept does not fully explain the high degree of 
regulation and specialisation that governs the O-glycosylation process. In particular it 
25 is difficult to envision how large mucin molecules with high densities of O-glycans 
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are glycosylated in the Golgi by stochastic mechanisms that also create other sparsely 
glycosylated proteins. 

The first step in mucin-type O-glycosylation is catalysed by one or 
more members of a large family of UDP-GalNAc: pojypeptide N- 
acetylgalactosaminyltransferases (GalNAc-transferases) (EC 2.4.1.41), which transfer 
GalNAc to serine and threonine acceptor sites (2). To date eight members of the 
mammalian GalNAc-transferase family have been identified and characterized, and 
several additional putative members of this gene family have been predicted from 
analysis of genome databases. The GalNAc-transferase isoforms have different 
kinetic properties and show differential expression patterns temporally and spatially, 
suggesting that they have distinct biological functions (2). Sequence analysis of 
GalNAc-transferases have led to the hypothesis that these enzymes contain two 
distinct subunits: a central catalytic unit, and a C-terminal unit with sequence 
similarity to the plant lectin ricin (3-6). Previous experiments involving site-specific 
mutagenesis of selected conserved residues confirmed that mutations in the catalytic 
domain eliminated catalytic activity. In contrast, mutations in the lectin domain had 
no significant effects on catalytic activity of at least one GalNAc-transferase isoform, 
GalNAc-Tl (3). Thus, the C-terminal "lectin domain" is believed not to be functional 
and have roles for .the enzymatic functions of GalNAc-transferases (3). 

Recent evidence demonstrates that some GalNAc-transferases exhibit 
unique activities with partially GalNAc-glycosylated glycopeptides. The catalytic 
actions of two GalNAc-transferase isoforms, GalNAc-T4 and -T7, selectively act on 
" glycopeptides corresponding to mucin tandem repeat domains where only some of the 
clustered potential glycosylation sites have been GalNAc glycosylated by other 
GalNAc-transferases (7-9). GalNAc-T4 and -T7 recognize different GalNAc- 



glycosylated peptides and catalyse transfer of GalNAc to acceptor substrate sites in 
addition to those that were previously utilized. One of the functions of such GalNAc- 
transferase activities is predicted to represent a control step of the density of O-glycan 
Occupancy in mucins and mucin-like glycoproteins with high density of O- 
glycosylation. Thus, the sequential actions of multiple GalNAc-transferase isoforms 
may be required to complete O-glycan attachments to some mucin peptide sequences 
allowing for detailed control of density. 

One example of this is the glycosylation of the cancer-associated 
mucin MUC1. MUC1 contains a tandem repeat O-glycosylated region of 20 residues 
(HGVTSAPDTRPAPGSTAPPA) with five potential O-glycosylation sites. GalNAc- 
Tl, -T2, and -T3 can initiate glycosylation of the MUC1 tandem repeat and 
incorporate at only three sites (HGVTSAPDTRPAPGSTAPPA, GalNAc attachment 
sites underlined). GalNAc-T4 is unique in that it is the only GalNAc-transferase 
isoform identified so far that can complete the O-glycan attachment to all five 
acceptor sites in the 20 amino acid tandem repeat sequence of the breast cancer 
associated mucin, MUC1. GalNAc-T4 transfers GalNAc to at least two sites not used 
by other GalNAc-transferase isoforms on the GalNAc4TAP24 glycopeptide 
(TAPPAHGVTSAPDTRPAPGSTAPP, GalNAc attachment sites underlined) (8). An 
activity such as that exhibited by GalNAc-T4 appears to be required for production of 
the glycoform of MUC1 expressed by cancer, cells where all potential sites are 
glycosylated (10). Normal MUC1 from lactating mammary glands has approximately 
2.6 O-glycans per repeat (1 1) and MUCl derived from the cancer ceil line T47D has 
4. 8 O-glycans per repeat (10). The cancer-associated form of MUC l is therefore 
associated with higher density of O-glycan occupancy and this is accomplished by a 
GalNAc-transferase activity identical to or similar to that of GalNAc-T4. The 



mechanism by which GalNAc-T4 and -T7 recognize and functions with GalNAc- 
glycosylated glycopeptides is not known. 

Glycosylation confers physico-chemical properties including protease 
resistance, solubility, and stability to proteins (12-14), Glycosylation furthermore 
5 confers changes in immunological responses to proteins and glycoproteins. O- 

glycosylation on mucins and mucin-Iike glycoproteins protect these molecules found 
in the extracellular space and body fluids from degradation. Control of O- 
glycosylation with respect to sites and number (density) of O-glycan attachments to . 
proteins as well as control of the O-glycan structures made at specific sites or in 

cn 

O 10 general on glycoproteins, is of interest for several purposes. Diseased cells e.g. cancer 
g cells often dramatically change their O-glycosylation and the altered glycans and 

u glycoproteins may constitute targets for therapeutic and diagnostic measures (15; 16). 

Mucins functioning in body fluids may have different properties depending on density 

O' 

In and structure of O-glycans attached in protection against disease, including infections 

15 by micro-organisms. Furthermore, mucins with different glycosylation may change 
physico-chemical properties including stability and solubility properties that may 
influence turnover and removal of mucous. A number of lung diseases, e.g. cystic 
fibrosis, asthma, chronic bronchitis, smokers lungs, are associated with symptomatic 
mucous accumulation (17-19), and it is likely that the nature and structure of mucins 
20 play a role in the pathogenesis of such diseases. 

Inhibitors of O-glycan processing in cells have been reported. Aryl-A^- 
acetyl-a-galactosaminides such as benzyl-, phenyl-, and /Miitrophenyl-GalNAc were 
originally found to inhibit synthesis of core 1 (Gaip l-3GalNAcal -R) and more 
complex structures (20). Benzyl-GalNAc was also found to inhibit. sialylation (21 ; 
25 22). Treatment of cells with benzyl-GalNAc not only inhibits O-glycan processing but 
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also selectively inhibits apical sorting of some O-glycosylated proteins (23-25). The 
mechanism for this is believed to be through inhibition of sialylation, however, past 
studies of the above mentioned inhibitors have not considered the effect on O-glycan 
occupancy. No inhibitors of O-glycan attachment and polypeptide GalNAc- 
transferases have been described. While inhibition of steps in O-glycan processing 
may alter the structures found on O-linked glycoproteins and the functions that 
specific glycans may exert, it may be expected that inhibition of steps in the process 
determining O-glycan attachments to proteins may have more profound effects on the 
functions, of mucins and glycoproteins in general. Such inhibitors could be directed to 
specific isoforms of polypeptide GalNAc-transferases and hence selectively inhibit O- 
glycan attachments at acceptor sites used by such isoforms. Moreover, such inhibitors 
could be selectively directed to isoforms functioning as follow-up enzymes with 
GalNAc-glycopeptide specificities, and inhibitors could also be directed to any other 
functions polypeptide GalNAc-transferases may have in the secretory pathway and 
intracellular sorting. 

Consequently, there exists a need in the art for methods of inhibiting the 
functions of polypeptide GalNAc-transferases. The present invention meets this need, 
and further presents other related advantages. 

3. Summary of the invention 

The existence of functional lectin domains on polypeptide GalNAc- 
transferase isoforms, GalNAc-T2, -T3, -T4, and -T7, and the functions of these in O- 
glycosylation are disclosed. The GalNAc-transferase isoforms, GalNAc-T4 and -T7, 
function with partially O-glycosylated peptides as follow-up enzymes. The putative 
lectin domains found in the C-terminus of these GalNAc-transferases function as a 



GalNAc lectin and confers the glycopeptides specificity to the enzymes. In a preferred 
embodiment, a method of selectively, blocking the glycopeptide specificities of such 
GalNAc-transferase isoforms by GalNAc and GalNAc containing structures is 
disclosed. 

Lectin domains with diverse sequences are found on other GalNAc- 
transferases, and the lectin domains of GalNAc-T2 and -T3 isoforms are also 
functional and recognize both carbohydrate and peptide sequences. These lectin 
domains may function as chaperones in directing mucin-type O-glycosylation, 
transport and sorting of glycoproteins. In yet a preferred embodiment, methods of 
blocking, the lectin domains of such GalNAc-transferase isoforms by GalNAc and 
GalNAc containing structures as well as peptides are disclosed. 
Inhibitors in accordance with the present invention are useful for changing the density 
and sites of O-glycan occupancy in mucins and O-linked glycoproteins. Further, uses 
are in changing Golgi-transport and intracellular sorting events conferred by the lectin 
domains of GalNAc-transferases. For example, inhibitors of lectin domains of 
GalNAc-transferases may be useful in manipulating disease associated O- 
glycosylation to augment immunity and to prepare vaccines. Further use may be 
found in manipulating mucin secretion and O-glycan density in diseases associated 
with mucous accumulation to decrease secretion and enhance clearance of mucins. 
Further use may entail modulating O-glycosylation of recombinant glycoproteins by 
inhibition of polypeptide GalNAc-transferases in host expression cells. These and 
other aspects of the present invention will become evident upon reference to the 
following detailed description and drawings. 



Figure 1 illustrates that the MUC1 glycopeptide specificity of 
polypeptide GalNAc-T4 is not directed by a specific glycoform. Panel A is a Schematic 
depiction of product development assays monitored by capillary electrophoresis (CE) 
and/or MALDI-TOF mass spectrometry. Left side illustrates MUC1 tandem repeat 
5 peptide glycoforms (open circles indicate attachments of GalNAc) prepared by in 
vitro glycosylation with indicated GalNAc-transferase isoforms. Right side illustrates 
products (closed circles indicate GalNAc residues added) developed in 6 hours by 
GalNAc-T4. Glycopeptides were characterized by mass spectrometry. Panel B is an 
illustration of the reactions with TAP25V21 monitored by capillary electrophoresis, 
0 where GalNAc-Tl and -T4 were mixed. Numbers above peaks refer to numbers of 
moles of GalNAc incorporated into the peptide. 

Figure 2 illustrates that the lectin domain of GalNAc-T4 selectively 
directs its MUC1 glycopeptide specificity. Panel A is a schematic depiction of the 
domain structure of polypeptide GalNAc-transferases modified from Hagen * al. (3). 
Arrows indicate conserved cysteine residues and the major conserved sequence motifs 
are shown with numbering according to the sequence of GalNAc-Tl. Bold underlined 
residues in the catalytic domain indicate some residues required for catalysis, whereas 
the two marked residues in the lectin domain are not essential for catalytic activity of 
GalNAc-Tl (3). A D459H mutation in the lectin domain of GalNAc-T4 corresponds 
to the illustrated D444H in GalNAc-Tl . Panel B is a time-course MALDI-TOF 
(matrix-assisted-laser-desorption-ionization time-of-flight) analysis of the 
glycosylation independent activities of wild-type GalNAc-T4 459D and the lectin 
mutant GalNAc-T4«»" using the unique substrate for this enzyme isoform derived 
from PSGL-1 [Thr in bold is the acceptor site (8)]. The control represents co-purified 
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endogenous activity found with irrelevant expression constructs. Wild-type and 
mutant GalNAc-T4 exhibit identical glycosylation independent activities. Panel C is 
a time-course MALDI-TOF analysis using the unique glycosylation dependent 
substrate GalNAc 3 TAP25 V2 1 (GalNAc attachment sites bold and underlined, and the 
5 two available acceptor sites for GalNAc-T4 in bold). The mutant GalNAc-T4 is 
virtually inactive with the glycppeptide substrate. 

Figure 3 illustrates that the lectin domain of GalNAc-T4 functions as a 
lectin and has selective specificity for GalNAc. Panel A: Inhibition of the 
glycosylation dependent function of GalNAc-T4 by free sugars. Time-course 

m 

q 10 MALDI-TOF analysis of GalNAc-T4 439D in the presence of 0.23 M free sugars, 

g indicate selective inhibition of activity in the presence of GalNAc. Panel B: Time- 

JJ course MALDI-TOF analysis of the glycosylation independent functions of wild-type 

!_ and mutant GalNAc-T4, show that GalNAc has no effect on the general catalytic 

Q £ . 

in function of the enzyme. 

1=* 15 

c 

M 5. Detailed description of the invention 

All patent applications, patents, and literature references cited in this 
specification are hereby incorporated by reference in their entirety. In the case of 
conflict, the present description, including definitions, is intended to control. 

20 

5.1. Definitions : 

1. "Nucleic acid" or "polynucleotide" as used herein refers to purine- and 
pyrimidine-containing polymers of any length, either polyribonucleotides or 
polydeoxyribonucleotides or mixed polyribo-polydeoxyribo nucleotides. This includes 
25 single- and double-stranded molecules, i.e., DNA-DNA, DNA-RNA and RNA-RNA 
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hybrids, as well as "protein nucleic acids" (PNA) formed by conjugating bases to an 
amino acid backbone. This also includes nucleic acids containing modified bases (see 
below). 

2. "Complementary DNA or cDNA" as used herein refers to a DNA 
i molecule or sequence that.has been enzymatically synthesised 'from the sequences 

present in an mRNA template, or a clone of such a DNA molecule. A "DNA Construct" 
is a DNA molecule or a clone of such a molecule, either single- or double-stranded, 
which has been modified to contain segments of DNA that are combined and juxtaposed 
in a manne? that would not otherwise exist in nature. By way of non-limiting example, a 
cDNA or DNA which has no introns is inserted adjacent to, or within, exogenous DNA 
sequences. 

3, A plasmid or, more generally, a vector, is a DNA construct containing 
genetic information that may provide for its replication when inserted into a host cell. A 
Plasmid generally contains at least one gene sequence to be expressed in the host cell, as 
well as sequences that facilitate such gene expression, including promoters and 
transcription initiation sites. It may be a linear or closed circular molecule. 

4. Nucleic acids are "hybridizable" to each other when at least one strand 
of one nucleic acid can anneal to another nucleic acid under defined stringency 
conditions. Stringency of hybridization is determined, e.g., by a) the temperature at 
which hybridization and/or washing is performed, and b) the ionic strength and polarity 
(e.g, formamide) of the hybridization and washing solutions, as well as other 
parameters. Hybridization requires that the two nucleic acids contain substantially 
complementary sequences; . depending on the stringency Of hybridization, however, ' 
mismatches may be tolerated. Typically, hybridization of two sequences at high 
stringency (such as, for example, in an aqueous solution of 0.5X SSC, at 65°C) requires 
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that the sequences exhibit some high degree of complementarity over their entire 
sequence. Conditions of intermediate stringency (such as, for example, an aqueous 
solution of 2X SSC at 65°C) and low stringency (such as, for example, an aqueous 
solution of 2X SSC at 55°Q, require correspondingly.^ overall complementarily 
between the hybridising sequences. (IX SSC is 0.15 M NaCl, 0.015 M Na citrate.) 

5. An "isolated" nucleic acid or polypeptide as used herein refers to a 
component that is removed from its original environment (for example, its natural 
environment if it is naturally occurring). An isolated nucleic acid or polypeptide 
contains less than about 50%, preferably less than about 75%, and most preferably less 
than about 90%, of the cellular components with which it was originally associated. 

6. A "probe" refers to a nucleic acid that forms a hybrid structure with a 
sequence in a target region due to complementarily of at least one sequence in the probe 
with a sequence in the target region. 

7. A nucleic acid that is "derived from" a designated sequence refers to a 
nucleic acid sequence that corresponds to a region of the designated sequence. This 
encompasses sequences that are homologous or complementary to the sequence, as well 
as "sequence-conservative variants" and "function-conservative variants". Sequence- 
conservative variants are those in which a change of one or more nucleotides in a given 
codon position results in no alteration in the amino acid encoded at that position. 
Function-conservative variants of polypeptide GalNAc-transferases are those in which a 
given amino acid residue in the polypeptide has been changed without altering the 
overall confoimation and enzymatic activity (including substrate specificity) of the 
native polypeptide; these changes include, but are not limited to, replacement of an 
amino acid with one having similar physico-chemical properties (such as, for example, 
acidic, basic, hydrophobic, and the like). 
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8. A "donor substrate" is a molecule recognised by, e.g., a polypeptide 
GalNAc-transferees and that contributes a ^-acetylgalactosamine moiety for the 
transferase reaction. For polypeptide GalNAc-transferases, a donor substrate is UDP-N- 
acetylgalactosamine or with some GalNAc-transferase isoforms UDP-galactose. An 
"acceptor substrate" is a molecule, preferably a peptide, protein, glycopeptide, and 
glycoproteins, that is recognised by, e.g., a polypeptide GalNAc.transferase and that is 
the target for the modification catalysed by the transferase, i.e., receives the carbohydrate 
moiety. For polypeptide GalNAc-transferases, acceptor substrates include without 
limitation peptides, proteins, glycopeptides, and glycoproteins. 

9. The A^-acetylgalactosaminyltransferase T4 (GalNAc-T4) gene has 
been isolated from a human salivary gland library as described previously (8). The 
sequence of the GaINAc-T4 nucleic acid so isolated and the sequence of the encoded 
GalNAc-T4 polypeptide have been submitted to GenBank/EBI Data Bank and . 
assigned accession number Y08564. 

10. The AT-acetylgalactosaminyltransferase T7 (GalNAc-T7) gene has 
been isolated from a human gastric carcinoma cell line MKN45 library as described 
previously (7). The sequence of the GalNAc-T7 nucleic acid so isolated and the 
sequence of the encoded GalNAc-T7 polypeptide have been submitted to 
GenBank/EBI Data Bank and assigned accession number AJ002744. 

1 1. The N-acetylgalactosaminyltransferase T2 (GalNAc-T2) gene has 
been isolated from a human gastric carcinoma ceil line MKN45 library as described 
previously (26). The sequence of the GalNAc-T2 nucleic acid so isolated and the 

sequence of the encoded GalNAc-T2 polypeptide have been submitted to " 

GenBank/EBI Data Bank and assigned accession number X85019. 
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12. The A^-acetylgalactosaminyltransferase T3 (GalNAc-T3) gene has 
been isolated from a human salivary gland library as described previously (27). The 
sequence of the GalNAc-T3 nucleic acid so isolated and the sequence of the encoded 
GalNAc-T3 polypeptide have been submitted to GenBank/EBI Data Bank and 
assigned accession number X92689. 

Expression to produce enzymatically-active GalNAc-T4, -T7, -T2, and 
-T3 can be carried out in any number of conventional expression systems familiar to 
those skilled in the art. In one embodiment, GalNAc-transferases are expressed in a 
soluble form which can be recovered from the culture medium. In another 
embodiment, host cells (e.g. CHO cells) are engineered to express GalNAc- 
transferases and glycosylate substrates in vivo in host cells. 

In accordance with one embodiment of the method of the invention, 
enzymatically active GalNAc-transferases are contacted with an acceptor substrate 
and an ^acetylgalactosamine donor substrate, preferably UDP-JV- 
acetylgalactosamine, under conditions for transfer of ^-acetylgalactosamine from the 
donor substrate to the acceptor substrate. Glycosylated acceptor substrate is then 
obtained. Preferred acceptor substrates are proteins, peptides, glycoproteins, and 
glycopeptides. Particularly preferred acceptor substrates for GalNAc-T4 are GalNAc- 
glycosylated glycopeptides from MUC1, MUC2, and MUC5AC tandem repeats or 
multimers of those molecules. Particularly preferred acceptor substrates for GalNAc- 
T7 are GalNAc-glycosylated glycopeptides from MUC2 and rat submaxillary gland 
mucin tandem repeats or multimers of those molecules. Particularly preferred 
acceptor substrates for GalNAc-T2 are peptides from MUC1, MUC2, MUC5AC and 
MUC7 tandem repeats or multimers of those molecules. Particularly preferred 
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acceptor substrates for GalNAc-T3 are peptides from MUC1, MUC2, MUC5AC and 
. MUC7 tandem repeats or multimers of those molecules. 

Transfer assays for carrying out glycosylation are familiar to those in 
the art, and are described in the literature cited above and in the examples provided 
below. 

As noted, human GalNAc-T4 demonstrates unique acceptor substrate 
specificity. GalNAc-T4 has been found to transfer GalNAc to two sites in the MUC1 
tandem repeat sequence: Ser in GVTSA and Thr in PDTR using a 24-mer 
glycopeptide with GalNAc residues attached at sites utilized by GalNAc-Tl, -T2 and 
T3 (TAPPAHGVTS APDTRPAPGSTAPPA, wherein the GalNAc sites are 
underlined) (8). In an important aspect of the invention, the action of GalNAc-T4 is 
dependent on prior GalNAc attachments at least at one site of the five acceptor sites in 
the MUC1 tandem repeat. In another important aspect of the invention this activity is 
dependent on a lectin domain constituting approximately 130 amino acid residues in 
the C-terminal region of GalNAc-T4. In yet an important embodiment of the 
invention this activity can be blocked by GalNAc and GalNAc containing compounds 
such as benzyl-GalNAc. 

As noted, human GalNAc-T7 demonstrates unique acceptor substrate 
specificity. GalNAc-T7 has only been found to transfer to acceptor substrates which 
have previously been partially GalNAc-glycosylated (7; 9). A preferred acceptor 
substrate is derived from MUC2, MUC5AC and rat submaxillary gland mucin tandem 
repeats. In an important embodiment of the invention the activity of GalNAc-T7 can 
be blocked by.GalNAc and GalNAc containing compounds such as benzyl-GalNAc, 
and Gal and Gal containing compounds such as benzyl-Gal and Gaipi-3GalNAcal- 
benzyl . 
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Human GaINAc-T2 demonstrates unique UDP-Gal donor substrate 
specificity with MUC2 peptide substrate (28). In an important embodiment of the 
invention the activity of GalNAc-T2 with UDP-Gal can be blocked by GalNAc and 
GalNAc containing compounds such as benzyl-GalNAc. 

Human GalNAc-T3 demonstrates unique UDP-Gal donor substrate 
specificity with rat submaxillary gland mucin peptide substrate. In an important 
embodiment of the invention the activity of GalNAc-T3 with UDP-Gal can be 
blocked by GalNAc and GalNAc containing compounds such as benzyl-GalNAc. 

The use of inhibitors of the lectin domain mediated activities of the 
above polypeptide GalNAc-transferase isoforms and other isoforms allows for 
selective inhibition of these functions in vitro and in vivo in cells and organisms. This 
is advantages in manipulating the density of O-glycans, e.g. changing high density O- 
glycosylated tumour-associated MUC1 to low density normal MUC1 in cells. Further 
this is advantages for inhibiting any adhesive role the lectin domains may play in 
Golgi transport and intracellular sorting. 

5.2. The MUC1 glycop eptide specificity of GalNAc-T4 is not directed by a 

specific glycoform. 

The GalNAc-T4 isoform displays enzyme activity which, in addition to 
showing activity with some peptide substrates, exhibits unique activity with 
glycopeptides where prior glycosylation is a prerequisite for activity (8). GalNAc-T4 
is unique in that it is the only GalNAc-transferase isoform identified so far that can 
complete the O-glycan attachment to all five acceptor sites in the 20 amino acid 
tandem repeat sequence (HGVTSAPDTRPAPGSTAPPA) of the breast cancer 
associated mucin, MUC1. GalNAc-T4 transfers GalNAc to at least two sites not used 

15 



by other GalNAc-transferase isoforms on the GalNAc 4 TAP24 glycopeptide 
(TAPPAHGVTSAPDTRPAPGSTAPP, GalNAc attachment sites underlined) (8). An 
activity such as that exhibited by GalNAc-T4 appears to be required for production of 
the glycoform of MUC1 expressed by cancer cells where all potential sites are 
glycosylated (10). In order to analyse activity of GalNAc-T4 with MUC1 derived 
GalNAc-peptides in detail different glycoforms of 24/25-mer peptides (TAP24/25) by 
using different GalNAc-transfera,se isoforms to catalyse glycosylation of selected sites 
in combination with valine substitutions of acceptor sites were prepared. Analysis of 
the substrate specificity of GalNAc-T4 with different glycoforms of MUC1 revealed 
that GalNAc-T4 did not show a requirement for any single site of GalNAc attachment 
(Fig. 1); however, there was a requirement for at least one of the three sites to be 
glycosylated. Thus, substitution of any one of the sites glycosylated in the 
GalNAc4TAP24/25 glycopeptide by valine did not affect activation of GalNAc-T4 
activity for glycopeptides. Catalytic activity with certain sites was affected by site 
specific modifications, in particular glycosylation of S in -VTSA- or -GSTA- was 
influenced by glycosylation at adjacent and distant sites. Nevertheless, this result 
suggests that there is a triggering of GalNAc-T4 activity in the presence of 
glycosylated MUC1 substrate that can not be ascribed to simple conformational 
changes in the acceptor substrate induced by the glycosylation. This led us to 
hypothesise that a triggering event that was independent of the general catalytic 
activity of the enzyme led to acquisition of specificity for GalNAc-glycopeptides. A 
likely candidate for the triggering event of glycopeptide activity was the putative 
lectin domain, which was previously. shown by mutational analysis to not significantly 
affect the activity of GalNAc-Tl with a peptide substrate (3). 
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5.3. The lectin domain of G alNAc-T4 selectively directs its MUC1 glycopep tide 

specificity. 

Since GalNAc-T4 exhibits both glycosylation independent and 
glycosylation dependent activities, it offers a modej system to analyse the different 
5 specificities as separate functions. Hagen et al (3) originally demonstrated that 

critical substitutions in the lectin domain of GalNAc-Tl have little affect on catalytic 
activity (reduction by 10-50%) with peptide substrates, while substitutions in the 
catalytic domain destroyed activity (Fig. 2, Panel A). It was predicted that mutation of 
an aspartate residue adjacent to a conserved CLD motif in the lectin domain to 
10 histiduie p444H in GalNAc-Tl corresponding to D459H in GalNAc-T4) would 
destroy putative lectin function based on analysis of ricin (29), but mutation of this 
y residue (D444H) in GalNAc-Tl only appeared to reduce activity by approximately 50 

iy • 

H %. To test if the lectin domain influenced glycopeptide specificity of GalNAc.-T4, 

5 

Q recombinant secreted forms of GalNAc-T4 459D and -T4 459H were prepared. GalNAc- 

r- j 5 X4 459D and -T4 459H exhibited essentially the same specific activity with several 
g unglycosylated peptides, in agreement with the results obtained for GalNAc-Tl (3) 

(illustrated for a PSGL-1 substrate in Fig. 2, Panel B). In contrast, the glycopeptide 
specificity of mutant GalNAc-T4 459H was selectively affected by the introduced 
mutations. Glycopeptides derived from tandem repeats of MUC1, MUC2 and 
20 MUC5AC (7) were virtually inactive as substrates, as is illustrated in Figure 2 (Panel 
C), which depicts assays with a GalNAc 3 TAP25V21 glycopeptide. Essentially 
identical results were observed with unsubstituted TAP24 and GalNAc4TAP24 
glycopeptide. These results show that the lectin domain. is required for the 
glycopeptide specificity of enzyme activity, but not for activity with naked peptide 
25 substrates. This demonstrates that the lectin domain triggers the catalytic domain of 
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GalNAc-T4 to act on GalNAc-glycopeptide substrates by an as yet unknown 
mechanism. Furthermore, it demonstrates that the basic catalytic function and the 
triggering event are independent properties associated with distinct domains of 
GalNAc-transferases. 

5 - 4 - The lectin domain of GalNAc-T4 functions as a lectin and has selective 
specificity for GalNAc. 
In order to determine if actual carbohydrate binding contributed to the 
function of the lectin domain, we analysed whether triggering of glycopeptide 
specificity could be blocked by specific carbohydrates in solution. We could not 
detect direct binding of GalNAc-T2 and -T4 to free GalNAc using conventional 
binding assays. However, as shown in Figure 3 (Panel A) the glycosylation dependent 
specificity of GalNAc-T4 was almost completely inhibited by incubation with 0.23 M 
free GalNAc, whereas other sugars, Gal, GlcNAc, or Fuc, failed to show significant 
inhibition. Assays with 50 mM sugars gave the same pattern, but with less 
(approximately 50 %) inhibition by GalNAc (not shown). Furthermore, similar 
inhibition was found with 10 mM a-D-GalNAc- 1 -benzyl, whereas otGlcNAc-benzyl 
did not inhibit catalytic activity. None of the sugars had significant affects on the 
glycosylation independent activities of GalNAc-T4 459D or -T4 459H , when assayed with 
naked peptides (Fig 3, Panel B). This demonstrates that the lectin domain of GalNAc- 
T4 binds to GalNAc and contributes to the ability of GalNAc T4 to catalyse 
glycosylation of glycopeptides. The finding that neither Gal nor Galpl-3GalNAcal- 
benzyl produced significant inhibition compared to GalNAc suggests that the second 
step of O-glycosylation (extension of the oligosaccharide side chains), which is 
catalysed by the P3galactosyltransferase forming the core 1 structure Gaipi- 
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3GalNAcal-0-Ser/Thr, may block the functional activity of the lectin domain of 
GalNAc-T4. Thus, once the O-glycan processing step involving elongation to the core 
1 structure is accomplished, GalNAc-T4 would not be capable of catalysing 
glycosylation of glycopeptides. This suggests that O-glycan elongation/branching and 
O-glycan density may be regulated by competition among GalNAc-transferases 
(lectin domain) and the glycosyltransferases involved in O-glycan extension, 
especially the core 1 synthase p3Gal-transferase. 

5.5. The lectin domain of GalNAc-T7 functions as a lectin and has selective 
specificity for GalNAc and GalBl-3GaINAc. 

GalNAc-T7 exhibits exclusive glycopeptide specificity and no 
unsubstituted acceptor peptide substrates have been identified thus far (7). GalNAc- 
T7 has a different glycopeptide substrate specificity than GalNAc-T4 and does not 
function with MUC1 derived glycopeptides. The best substrate identified to date is 
derived from the tandem repeat region of rat submaxillary gland mucin (30). The 
activity of GalNAc-T7 with GalNAc 2 . 3 EA2 was significantly inhibited by benzyl- 
aGalNACi benzyl-pGal, and the Gaipi-3GalNAcal -benzyl disaccharide core 1 
structure at 5 mM concentrations (Table I). 

Table I Inhibition of GalNAc-T7 activity with GalNAc 2 . 3 EA2 substrate 

Activity (nmol/min/rnl) in the presence of inhibitors (5 mM) 
None bz-aMan bz-pGal bz-aGalNAc Galpl-3GalNAcctl-bz 

6.8 6.7 4.7 5.4 4.5. 

5.5. The lectin domain of GalNAc-T2 is functional and has selective specificity 
for GalNAc and the MUC2 and MUC5AC tandem repeat peptides. 
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GalNAc-T2 exhibits galactosyltransferase activity in the presence of 
the Muc2 acceptor substrate (28). Furthermore, testing a panel of peptide substrates it 
was found that GalNAc-T2 also utilized Muc7 and to lesser degree the EA2 peptide in 
the presence of UDP-Gal (GalNAc-T2 activity with UDP-Gal: Muc2, 90 
nmol/min/ml; Muc7, 13 nmol/min/ml; EA2, 1.5 nmol/min/ml). The 
galactosyltransferase activity with Muc2 substrate was selectively inhibited by 
GalNAc and not other sugars. 

Table II Inhibition of GalNAc-T2 activities with Muc2 acceptor substrate 
Donor Substrate Activity (nmol/min/ml) in the presence of inhibitors (230 mM) 

~~~~~ None GalNAc GlcNAc Gal Fuc 

UDP-GalNAc 340 3(H) MO 300 370 

UDP-Gal 90 24 68 89 89 

Since the galactosyltransferase activity exhibited by GaINAc-T2 exhibits an entirely 
different acceptor substrate pattern than the N-acetylgalactosaminyitransferase 
activity, it is concluded that the lectin domain exhibits peptide binding specificity in 
addition to GalNAc. Hence, the mechanism of activation resemble that of the 
glycopeptide specificity of GalNAc-T4 only the trigger is a peptide sequence motif 
comprised in the Muc2 and Muc7 peptide sequences. 

5.5. The lectin domain of GalNAc-T3 is functional and has selective specificity 
for GalNAc and the MUC5AC and rat submaxillary tandem repeat peptides. 

GalNAc-T3 was found also to exhibit galactosyltransferase activity but 
only in the presence of the EA2 acceptor substrate (GalNAc-T2 activity with UDP- 
Gal: Muc2, 0 nmol/min/ml; Muc7, 0.1 nmol/min/ml; EA2, 6.8 nmol/min/ml). The 
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galactosyltransferase activity with Muc2 substrate was selectively inhibited by 
GalNAc and not other sugars. 



Table III Inhibition of GalNAc-T3 activities with EA2 acceptor substrate 



Donor Substrate 


Activity (nmol/min/ml) in the presence of inhibitors (230 mM) 




None 


GalNAc 


GlcNAc 


Gal Fuc 


UDP-GalNAc 


34 


35 


32 


33 34 


UDP-Gal 


6.3 


2.5 


7 


7.3 ' 6.5 



The lectin domain of GalNAc-T3 resemble that of GalNAc-T2 in binding to peptide 
sequences although the sequence motif must be different and partly contained in the EA2 
sequence. 

These and other embodiments of the present invention are described in 
more detail below. The following examples are intended to further illustrate the 
invention without limiting its scope. 

6. Examples 

Standard reaction mixtures (50 u.1 final volume) contained 25 mM cocadylate 
(pH 7.4), 10 mM MnCl 2 , 0.25% Triton X-100, 200 uM UDP-[ ,4 C]-GalNAc (2,000 
cpm/nmol)(Amersham), 200-500 uM acceptor peptides. Products were quantified by 
scintillation counting after chromatography on Dowex-1, octadecyl silica cartridges 
(Bakerbond), or HPLC (PC3.2/3 or mRPC C2/C18 SC2.1/10 Pharmacia, Smart System). 
Acceptor peptides included five variants of TAP25 
(TAPPAHGV(TAV)SAPDTRPAPG(SA^)(T/V)APPA) and TAP24 

(TAPPAHGVTSAPDTRPAPGSTAPP) derived from the human MUC1 tandem repeat (31); 
MUC2 (PTTTPISTTTMVTPTPTPTC) derived from human intestinal mucin MUC2 (32); 
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MUC5 AC (Ac-SAPTTSTTSAPT) derived from human respiratory gland mucin MUC5AC 
(33); MUC7 (Ac-CPPTPSATTPAPPSSSAPPETTAA) derived from human salivary gland 
mucin MUC7 (34); EA2 (PTTDSTTPAPTTK) derived from rat submandibular gland mucin 
(30); VTHPGY (Ac-PFVTHPGY) derived from human fibronectin (35); Zonadhesin 
(PTERTTTPTKRTTTPTIR) derived from human zonadhesin (36); OSM fragment 
(LSESTTQLPGGGPGCA) derived from ovine submaxillary mucin (37); hCG-P 
(PRFQDSSSSKAPPPLPSPSRLPG) derived from human chorionic gonadotropin P-subunit 
(38); MUClb (RPAPGSTAPPA) derived from MUC1 and PSGL-lb (Ac- 
QATEY^YLDYDFLPETEPPEM) derived from the iV-terminus of P-selectin ligand-1 (39) 
GalNAc-glycopeptides of MUC2, MUC5AC and MUC7 were produced using cold UDP T 
GalNAc and purified human recombinant GalNAc-Tl and -T2 (28). Different GalNAc- 
glycoforms of EA2 were produced by limiting the ratio of UDP-GalNAc to 2 moles, 3 moles, 
4 moles or 5 moles per mole of acceptor peptide. Glycopeptides were purified on Supelclean 
LC-18 columns (1 ml, Supelco), and the number of GalNAc residues incorporated evaluated 
by MALDI-TOF mass spectrometry. The enzyme sources used were semipurified as 
previously described by successive sequential ion-exchange chromatographies on Amberlite 
(IRA95, Sigma) or DEAE Sephacel (Pharmacia), S-Sepharose Fast Flow (Pharmacia), and 
Mini-S~(PC 3.2/3, Pharmacia) using the Smart System (Pharmacia) (28). Secreted GalNAc- 
T4 was obtained from a stably transfected CHO line (CHO/GalNA-T4/2 1 A) (8) grown in 
roller bottles in HAMS F12 supplemented with 10 % Fetal Bovine Serum. Experiments 
illustrated in Fig. 1 was performed with recombinant secreted GalNAc-T4 obtained 
from a stably transfected CHO line (8). Experiments illustrated in Figs. 2 and 3 were 
performed with secreted GalNAc-T4 expressed in High Five cells grown in serum- 
free medium (8). Structural analysis of glycopeptides were performed by a . 
combination oTPFPA (pentafluoropropionic acid) hydrolysis and MALDI-TOF mass 
spectrometry as previously described (40). Secreted GalNAc-T7 was obtained from 
infected High Five™ cells grown in serum-free medium (Invitrogen) in upright roller bottles 
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shaken 140 rpm in waterbaths at 27°C. GalNAc-T7 was not purified by Mini-S as the yields 
from cationic chromatography were low due to its low pi (6.4). 

6.1. Reaction kinetics monitored by Capillary Electrophoresis. 

Reaction mixtures were modified to include 1.7 mM cold UDP- 
GalNAc, 25 j^g acceptor peptides, and purified GalNAc-transferases in a final volume 
of 100 ul. The amount of GalNAc-transferase added was adjusted so that the reaction 
with the^appropriate peptide was near completion in six hours. Reactions were 
incubated in the sample carousel of an Applied Bidsystem model HT270 at 30°C as 
described previously (28). Electrophoretograms were produced every 60 min, and 
after six hours the reaction mixtures were separated by reverse phase HPLC for 
structural determination. HPLC was performed on a Brawnlee ODS column (2. 1 mm 
x 30 mm, 5 jam particle size) (Applied Biosystems, Inc.) using a linear gradient 
(0-30%, 0.1% TF A/0.08% TFA, 90% acetonitrile, 30 min) delivered by an ABI 130A 
micro-bore HPLC system (Perkin Elmer Inc). 

6.2 Structural analysis of reaction products. 

Glycopeptides were purified by HPLC and analysed by a combination 
of PFPA (pentafluoropropionic acid, Sigma) hydrolysis and MALDI-TOF mass 
spectrometry. Glycopeptides (50 pmol) were lyophilized in 500 \x\ Eppendorf vials 
and placed in a 22 ml glass vial with a mminert valve (Pierce, Rockford, IL). A 
solution of 100 u.1 20% PFPA (aqueous) containing 500 jig DTT was added to the 
bottom of the glass vial, which was then flushed with argon. The vial was evacuated 
to 1 mbar, and placed in an oven at 90°C for 60 min. The hydrolyzed samples were 
centrifuged in a vacuum centrifuge for 15 min to remove remaining traces of acid. 
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Lyophilized samples were reconstituted in 0.1% TFA to a concentration of 1 pmol/ul. 
Mass spectra were acquired on either Voyager-DE or Voyager-Elite mass 
spectrometers equipped with delayed extraction (Perseptive Biosystem Inc.). The 
matrix used was 2,5-dihydroxybenzoic acid (10 mg/ml, Hewlett-Packard) dissolved in 
5 a 2: 1 mixture of 0. 1% trifluoroacetic acid in 30 % aqueous acetonitrile (Rathburn 
Ltd.). Samples dissolved in 0.1% trifluoroacetic acid to a concentration of 
approximately 80 fmol to 1 pmol/ml were prepared for analysis by placing 1 ul of 
sample^olution on a probe tip followed by 1 ul of matrix. The hydrolyzed samples 
were prepared for MALDI analysis using nano-scale reversed-phase columns (Poros 
q 10 R3, PerSeptive Biosystem), according to previously described procedure (41). 

m 

P Samples were prepared by mixing 0.8 |il of total fraction volume 2 pmol of 

w • 
w 

UJ 
M 

iji Biosy stems Inc., Framingham, Ma, USA) equipped with delayed ion extraction 

u 

15 technology. Data processing was performed using software packages Perseptive- 
Q ' . 

O Grams (Galactic Industries Corp.) and protein analysis software GPMAW 

(htpp://www.welcome.to/gpmaw; Lighthouse data, Odense, Denmark) 



hydrolyzed glycopeptides and 0.4 ul of matrix solution. Mass spectra were acquired 
in reflector mode on a Voyager-Elite Biospectrometry Workstation (PerSeptive 



6.3. Reaction kinetics monitored by mass spectrometry. 

20 MALDI-TOF time-course in terminal reactions were performed in 

reactions of 25 ul containing 2.5 nmol acceptor (glyco)peptide, 40 nmol UDP- 
GalNAc, and 0.4 |ig GalNAc-T4. Sampling of reactions (1 ul) were purified by nano- 
scale reversed-phase chromatography (Poros R3, PerSeptive Biosystem) and applied 
directly to the probe with matrix (41). The amount of GalNAc-transferase added was 

25 adjusted so that the reaction with the appropriate peptide was near completion in six 
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hours. Reactions were incubated at 37°C in a shaker bath. At times 0, 2 hours, and 16 
hours a 1 u.1 aliquot was taken and purified. Mass spectra were acquired on either 
Voyager-DE mass spectrometer equipped with delayed extraction (Perseptive 
Biosystem Inc.). The matrix used was 2,5 -dihydroxybenzoic acid (10 mg/ml, Hewlett- 
Packard) dissolved in a 2:1 mixture of 0.1% trifluoroacetic acid in 30 % aqueous 
acetonitrile (Rathburn Ltd.). 

^ ^Construction, expression, purification, and analysis of a lectin domain 
mutant of GalNAc-T4. 

The mutant GalNAc-T4 459H was prepared by multiplex PCR using the 
GalNAc-T4-sol construct that encodes residues 32-578 inserted into pT7T3U19 (8). 
Primers EBHC332 (5 '-GT AG AGGG ATCTCGTCTG AATGTTT AC ATTATA-3 ' 
(mutation underlined in bold)) and T7 (5'-TAATACGACTCACTATAGGG-3) were 
used in a standard reaction under the following cycling conditions; 95°C 45sec, 51°C 
5sec, 72°C lmin. For 18 cycles using a Tc 2400 thermocycler (PE Biosystems, 
USA). The PCR product was digested with BstYl gel purified. 5 ng hereof was mixed 
with 10 ng pAcGP67-GalNAc-T4sol (8), and the mixture used to prime a "shuffle 
PCR" reaction using primers T7/EBHC201 (5'AAGCGGGCACCATATGCTCG-3'), 
using standard conditions and the following cycling conditions 95°C 45sec, 51°C 
5sec, 72°C lmin. (5 cycles without primers, after which primers were added and the 
reactions was cycled for an additional 17 cycles). The generated PCR product was 
digested with HindUI and inserted into HindUl digested GalNAc-T4 pT7T3Ul 9 
construct described above. The mutated T4- oenstruct was fully sequenced and the 
BamHl insert subcloned into pAcGP67. 
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Wild-type and mutant constructs expressed in insect cells were secreted in 
comparable yields, and the purified proteins migrated by SDS-PAGE identically. 
Quantification of purified proteins was done by Coomassie stained SDS-PAGE and 
titration of immunoreactivity with the monoclonal-antibody, UH6(4G2) (8). GalNAc- 
5 T4 459D and -T4 459H were purified to 0.04 \xg/\x\ and 0.1 ^ig/u.1 with specific activities of 
0.197 U/mg and 0.24 U/mg with a MUC7 tandem repeat derived peptide (7), 
respectively. Wild-type GalNAc-T4 and mutant GalNAc-T4, GalNAc-T4 459H , were 
analyseaVwith unglycosylated peptides (represented by PSGL-1) or GalNAc 
glycosylated glycopeptides (represented by GalNAc 3 TAP25V21) in reactions of 25 |il 

10 containing 2.5 nmol acceptor (glyco)peptide, 40 nmol UDP-GalNAc, and 0.4 [ig 
GalNAc-T4. Time-course assays were motioned by MALDI-TOF. Sampling of 
reactions (1 ul) were purified by nano-scale re versed-phase chromatography (Poros 
R3, PerSeptive Biosystem) and applied directly to the probe with matrix. Evaluation of 
inhibition of the glycopeptide specificity of wild-type GalNAc -T4 with free sugars was 

15 performed to establish if the lectin domain recognized carbohydrate. Analysis was performed 
as above with 0.23 M free GalNAc, Gal, GlcNAc, or Fuc, and the reaction was monitored by 
MALDI-TOF. Further, analysis was performed with 10 mMa-D-GalNAc-1 -benzyl, 
aGlcNAc-benzyl, and fully occupied GalNAc-glycopeptide, GalNAc 6 TAP25, at 5 mM. 

20 6.5 Inhibition of the GalNAc-glycopeptide activity of GalNAc-T7. 

GalNAc-T7 activity was analysed with GalNAc glycosylated glycopeptides 
(represented by GalNAc2oEA2 (7)) in reactions of 25 ul containing 2.5 nmol acceptor 
(glyco)peptide, 40 nmol UDP-GalNAc, and purified GalNAc-T7. Assays were 
performed with 0.23 M free GalNAc, Gal, GlcNAc, or Fuc, and the reaction product 
25 quantified by Dowex-1 chromatography and scintillation counting. Further, analysis was 
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performed with 10 mMa-D-GaINAc-1 -benzyl, aGlcNAc-benzyl, and fully occupied 
GalNAc-glycopeptide, GalNAc 6 TAP25 , at 5 mM. 

6.6. Inhibition of lectin domains of GalN Ac-trans ferases, GalNAc-T2 and -T3, 
that do not exhibit glycopeptide specificities, 

GalNAc-T2 exhibits activity with UDP-Gal in the presence of the acceptor substrate 
Muc2 (28). Galactosyl transferring activities of GalNAc-Tl, -T2, and -T3, were assayed 
with a panel of acceptor peptides in standard reaction mixtures containing 100 uM UDP- 
Gal instead of UDP-GalNAc. GalNAc-T2 showed activity with Muc2 as well as low 
activity with Muc7 and very low activity with EA2 acceptor substrates. GalNAc-T3 
showed activity with EA2 and lower activity with Muc7, but no activity with other 
peptides tested. Since the activities with UDP-Gal do not correlate with the general 
acceptor substrate specificities of these GalNAc-transferase isoforms found with the 
UDP-GalNAc donor substrate, it was tested if the lectin domains were involved. This 
was done by analysing if free sugars could selectively inhibit the activities with UDP- 
Gal and not UDP-GalNAc. Assays were performed with 0.23 M free GalNAc, Gal, 
GlcNAc, or Fuc, and the reaction product quantified by Dowex-1 chromatography and 
scintillation counting. Further, analysis was performed with 10 mMa-D-GalNAc-1 -benzyl, 
aGlcNAc-benzyl, and fully occupied GalNAc-glycopeptide, GalNAc 6 TAP25, at 5 mM. 

What is claimed is: 

1 . A method of inhibiting or modulating functions mediated by lectin 
domains of polypeptide GalNAc-transferase comprising administering an effective 
amount of appropriate carbohydrate, peptide, or glycopeptide portion or fragments 
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which is effective in binding to one or more lectin domains of polypeptide GalNAc- 
transferases and inhibiting functions mediated by said lectin domains. 

2. The method of claim 1 where the inhibited function is GalNAc- 
glycopeptide specificity of polypeptide GalNAc-transferases. 

3 . The method of claim 1 where the inhibited function is Gaip 1 - 
3 GalN^c-glycopeptide specificity of polypeptide GalNAc-transferases. 

4. The method of claim 1 where the inhibited function is UDP-Gal donor 
substrate specificity. 

5. The method of claim 1 wherein the carbohydrate portion comprises a 
GalNAc residue. 

6. The method of claim 1 wherein the carbohydrate portion comprises a 
Gal residue. 



7. The method of claim 1 wherein the carbohydrate portion comprises the 
Gaipi-3GalNAc disaccharide. 



8. The method of claim 1 wherein the carbohydrate portion is linked to an 
acceptable carrier. . 



9. The method of claim 5 wherein said carrier is a benzyl group. 
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10. The method of claim 1 wherein the glycopeptide comprises GalNAc- 
glycosylated glycopeptides. 



1 1 . The method of claim 1 wherein the glycopeptide comprises Gaip 1 - 
3GalNAc-glycosylated glycopeptides. 



^ .12. The method of claim 1 wherein the peptide represents sequences 
contained in the tandem repeats of human and animal mucins. 



13. The method of claim 1 wherein the inhibitor is linked to a 
pharmaceutical carrier. 

14. The method of claim 1 wherein the polypeptide GalNAc-transferase is 
GalNAc-T4. 

15. The method of claim 1 wherein the polypeptide GalNAc-transferase is 
GalNAc-T7. 

1 6. The method of claim 1 wherein the polypeptide GalNAc-transferase is 
GalNAc-T2. 

"17. The method of claim 1 wherein the polypeptide GalNAc-transferase is 
GalNAc-T3. 
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18. The method of claim 1 wherein the lectin mediated functions is 
glycopeptide specificity of polypeptide GalNAc-transferases. 



19. The method of claim 1 wherein the lectin mediated functions is peptide 
specificity of polypeptide GalNAc-transferases. 
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Abstract of the invention 

Attachment of O-glycans to proteins is controlled by a large family of 
homologous polypeptide GalNAc-transferases. Polypeptide GalNAc-transferases contain a 
C-terminal sequence with similarity to lectins. This invention discloses that the putative 
lectin domains of GalNAc-transferase isoforms, GalNAc-T4, -T7, -T2, and -T3, are 
functional and recognize carbohydrates, glycopeptides, and peptides. These lectin domains 
have different binding specificities and modulate the functions of GalNAc-transferase 
isoforms differently. The lectin domains of GalNAc-T4 and -T7 directs the glycopeptide 
specfficities of these enzymes, while the lectin domains of GalNAc-T2 and -T3 directs 
broadening of donor substrate specificities to include UDP-Gal. Selective inhibitors of 
these functions that do not eliminate general catalytic activities are disclosed. Such 
inhibitors were free sugars, aryl-sugar derivatives, and glycopeptides. Inhibitors of 
GalNAc-transferase lectin domains are useful for altering the O-glycosylation capacity of 
cells, tissues, and organisms, and such compounds may also interfere with other binding 
functions involved in intracellular sorting and secretion events. 
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ABSTRACT 



Attachmenl of O-glycans lo proteins is controlled by a Urge 
family of homologous polypeptide GalNAc-transfcrascs. 
Polypeptide GalNAc-transfcrascs contain a C-tenninal 
sequence with similarity to lectins. This invention discloses 
that the putative lectin domains of GalNAc-transferase iso- ' 
forms, GalNAc-T4, -T7, -T2, and -T3, are functional and 
recognize carbohydrates, glycopcplides, and peptides and 
discloses the lectin domains of GalNAc-'ll-'116. "Inese 
lectin domains have different binding specificities and 
modulate the functions of GalNAc-transferase isoforms dif- 
ferently. Novel methods for identification of inhibitors or 
modulators of binding activities mediated by lectin domains 
of polypeptide GalNAc-transferases are disclosed. Direct 
binding activity of GalNAc-transfcrasc lectins has been 
demonstrated for the first time and methods to measure 
lectio mediated binding of isolated lectins or enzymes with 
lectio domains are disclosed. The present invention specifi- 
cally discloses a novel selective inhibitor of polypeptide 
GalNAc-transferase lectin domains, which provides a major 
advancement in that this inhibitor and related inhibitors 
sharing common characteristics of activity bind lectin 
domains without serving as acceptor substrate for glycosyl- 
transferases involved in synthesis of O-glycans. Tnis inhibi- 
tor is represented by the p-iinomeric configuration of Gal- 
NAc-bcnzyl, GalNAcP-bcnzyl. Methods for inhibiting 
intracellular transport, cell surface expression, and secretion 
of mucins and O -glycosylated glycoproteins without affect- 
ing O-glycosylation processing arc disclosed using the novel 
selective inhibitor identified. 
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